Stabilizing Sparse Cox Model Using Statistic and Semantic Structures in Electronic Medical Records
نویسندگان
چکیده
Stability in clinical prediction models is crucial for transferability between studies, yet has received little attention. The problem is paramount in high dimensional data, which invites sparse models with feature selection capability. We introduce an effective method to stabilize sparse Cox model of time-to-events using statistical and semantic structures inherent in Electronic Medical Records (EMR). Model estimation is stabilized using three feature graphs built from (i) Jaccard similarity among features (ii) aggregation of Jaccard similarity graph and a recently introduced semantic EMR graph (iii) Jaccard similarity among features transferred from a related cohort. Our experiments are conducted on two real world hospital datasets: a heart failure cohort and a diabetes cohort. On two stability measures – the Consistency index and signal-to-noise ration (SNR) – the use of our proposed methods significantly increased feature stability when compared with the baselines.
منابع مشابه
Stabilizing Sparse Cox Model using Clinical Structures in Electronic Medical Records
Stability in clinical prediction models is crucial for transferability between studies, yet has received little attention. The problem is paramount in highdimensional data which invites sparse models with feature selection capability. We introduce an effective method to stabilize sparse Cox model of time-to-events using clinical structures inherent in Electronic Medical Records (EMR). Model est...
متن کاملEvaluation of risk factors of recurrence of hodgkin\'s lymphoma using random survival forest and comparison with cox regression model
Background: In many studies, Cox regression was used to assess the important factors that affect the survival of cancer patients based on demographic and clinical variables. The aim of this study was to determine the factors affecting the survival of patients with Hodgkin's lymphoma using the random survival forest (RSF) method and compare it with the Cox model. Methods: In this retrospective ...
متن کاملAn automated model to identify heart failure patients at risk for 30-day readmission or death using electronic medical record data.
BACKGROUND A real-time electronic predictive model that identifies hospitalized heart failure (HF) patients at high risk for readmission or death may be valuable to clinicians and hospitals who care for these patients. METHODS An automated predictive model for 30-day readmission and death was derived and validated from clinical and nonclinical risk factors present on admission in 1372 HF hosp...
متن کاملIdentification of Factors Affecting Metastatic Gastric Cancer Patients’ Survival Using the Random Survival Forest and Comparison with Cox Regression Model
Background and Objectives: In survival analysis, using the Cox model to determine the effective factors requires the assumptions whose failure of leads to biased results. The aim of this paper was to determine the factors affecting the survival of metastatic gastric cancer patients using the non-parametric method of Randomized Survival Forest (RSF) model and to compare its result with the Cox m...
متن کاملDeterminant factors of survival time in a cohort study on HIV patient using by time-varying cox model: Fars province, south of Iran
Background and aims: The pandemic of AIDS is a global emergency and one of the biggest challenges in social and individual life. This study aimed to evaluate the survival time of HIV patients and its effective factors. Methods: This historical cohort study was conducted on the individuals infected with HIV in Fars province, south of Iran, during 2006 to 2...
متن کامل